Goto

Collaborating Authors

 spatiotemporal frequency channel


1baff70e2669e8376347efd3a874a341-Reviews.html

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. COMMENTS BASED ON REVIEWER DISCUSSIONS AND AUTHOR REBUTTAL: I agree with the other reviewers that more could be done to constrain the specifics of the cue integration mechanism. However, I believe that if the data set is expanded, allowing the models to be better constrained, then the paper is appropriate and interesting for the NIPS community. I have left my quality score as it was, but I agree with the other reviewers that the paper merits a ``1'' rather than a ``2'' for impact score. ORIGINAL REVIEW: Summary: This paper extends an existing model for the perception of visual speed that uses a Bayesian observer model acting on the activity of independent spatiotemporal frequency channels. Previously, the model accounted for illusions of perceived speed by postulating the Bayes-optimal combination of noisy sensory representations with a prior for slow speeds.


Optimal integration of visual speed across different spatiotemporal frequency channels

Neural Information Processing Systems

How do humans perceive the speed of a coherent motion stimulus that contains motion energy in multiple spatiotemporal frequency bands? Here we tested the idea that perceived speed is the result of an integration process that optimally combines speed information across independent spatiotemporal frequency channels. We formalized this hypothesis with a Bayesian observer model that combines the likelihood functions provided by the individual channel responses (cues). We experimentally validated the model with a 2AFC speed discrimination experiment that measured subjects' perceived speed of drifting sinusoidal gratings with different contrasts and spatial frequencies, and of various combinations of these single gratings. We found that the perceived speeds of the combined stimuli are independent of the relative phase of the underlying grating components. The results also show that the discrimination thresholds are smaller for the combined stimuli than for the individual grating components, supporting the cue combination hypothesis. The proposed Bayesian model fits the data well, accounting for the full psychometric functions of both simple and combined stimuli. Fits are improved if we assume that the channel responses are subject to divisive normalization. Our results provide an important step toward a more complete model of visual motion perception that can predict perceived speeds for coherent motion stimuli of arbitrary spatial structure.


Optimal integration of visual speed across different spatiotemporal frequency channels

Neural Information Processing Systems

How does the human visual system compute the speed of a coherent motion stimulus that contains motion energy in different spatiotemporal frequency bands? Here we propose that perceived speed is the result of optimal integration of speed information from independent spatiotemporal frequency tuned channels. We formalize this hypothesis with a Bayesian observer model that treats the channel activity as independent cues, which are optimally combined with a prior expectation for slow speeds. We test the model against behavioral data from a 2AFC speed discrimination task with which we measured subjects' perceived speed of drifting sinusoidal gratings with different contrasts and spatial frequencies, and of various combinations of these single gratings. We find that perceived speed of the combined stimuli is independent of the relative phase of the underlying grating components, and that the perceptual biases and discrimination thresholds are always smaller for the combined stimuli, supporting the cue combination hypothesis. The proposed Bayesian model fits the data well, accounting for perceptual biases and thresholds of both simple and combined stimuli. Fits are improved if we assume that the channel responses are subject to divisive normalization, which is in line with physiological evidence. Our results provide an important step toward a more complete model of visual motion perception that can predict perceived speeds for stimuli of arbitrary spatial structure.